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W LOAD BALANCING 

FIELD OF THE INVENTION 

omD uterne W or kSl n general, and in particular to load 

BACKGROUND OF THE INVENTION 

,•„„ a server from becorrung 
u <nch as the Internet, preventing a serv 
In computer networks, such having 

k p accomplished by providing several 

redundant capabilities and managmg the dtsmb 

■none early impiementat.on of, . . severa ,n. messes for a Si n g >edo m ai„ n a m e, 

wit h each address correspond to one ^ ^ ^ 

. ,,,„., translation and responds by return, s 
server receives a request for address ^ DNS ^ 

. „Mress at random to connect to. ™ 

round . ro „„ fasWo , or activeiy monitors 

More recent,, a de.ce ^n as a ^ ^ ^ ^ _ , M ds as fo»o», 

Th e toad balancer is provided as a gateway ,0 ^ ^ ^ ^ ^ ^ ^ ^ the 
geographical location and referred to as a server arm ^ ^ ^ ^ ^ 

— - - — address ,s ref :; d ^ « — - - *- - 

f the servers to which it is connected. Client 
addresses of the servers 


--. hased o„ server load and 

il,,>h then sends the request to a s» based 
address of the ioad balanc»»ch then 

avai.abiiity or using other tnown techniques. ^ ^ ^ ^ 

-a— rverfarrnroasecondo ^ ^ ^ — used invoWes 

^er^areav^etotendtore 

^ding an HTTP redirect message - ^ ^ fe fc ^ 

- - • rwou,e the : i > — — * - - - * mp,oyed in 

m essage. This method o f - - « ^ ^ „ ^ .other 

response to HTTP requests, and not , a „ cer t0 act asaDNS server. Upon receiving 

teroutingmethodinvoives—ngthefir, 

a D NS request, the nrs, ioad baiancer simp,, returns ^ ^ ^ 

i response to «- — -re there J ^ ^ ^ ^ request5 to 

:.I tatermediateDHSserversmayresuUrnaprevrousiy 

^graphicaiiocationorachentmaybecons ^ load bal anci„g techniques. 

* the eeoeraphically nearest server, iu 
^tnotnecessa.vprovidethechentw.th.he.s ^ ^ ^ „ 


pro vide more processing capacity at the server. 

SUMMARY OF THE INVENTION 


Tn e present inSn — » — — • — * " " ^ 

There is thus provided in accordance with a preferred embodnnen, - - *~ 

* on a network the method including receiving a 
— a — *r ~ — • «- ^ : . fot load „ ha.n g a nrs, 

- — — addr : the ::; : : ^ — — — - 

request from the hrst 10 destination address 

- - - — . rnz— — - 

rrrr 

the tirst iod indicating the first 

— rrrn m — -- — 

load balancer network address associate 

Further in accordance with a preterrea em 

— - — — - — — — — " "* 

,oad balancer network address a. the second load balancer, 

tat ,oad balance,. , h a pref erred embodiment of , he present Invent the 

Additionally in accordance w«h a preferred 

, , „ the second load balancer to a server in commumcauon 
^odinaudesdirectin.thercuestfrom.heseco 

„„„ the second load balancer, composing the response a, the server, 
the second load balancer. 
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^* J - tU „ nre ^ftembodiment of the present 

There is al»vided in accordance with a pre^e 

♦ ,«rV the method including determining the 
mention a method for load balancing requests on a network, the metho 

* the reauestor to the closest load balancer, 
quests from the reque ^ ^ ^ ^ ^ 

Further in accordance with a preferred 

— — 7" ^ a prrferred embodimen , or the present — the 

Still further in accordance with a preterre 

, „H of each of the load balancers, and performing the 

5 Moreover in accordance with a preierrc 

I dete^nsstepincMesde— a, at leas, one f,*ed time. 

FutthM ,„ accordance with a pitted en,—, of the Ptesen, 

• w «t least two attributes selected from the 
determining step .eludes pol,in g the requestor to yeld at 

i »• oTTT and number of hops to requestor. 
groupconsistinsoflatencrdanveTTUandnu 

♦v, * nreferred embodiment of the present inven 
Still further in accordance with a preferred 

•no at least two polling methods selected from the 

— - — - : . - — - — - 

i • „ Tr p acK message to the request 
port, sendtng a TCP ACK m ^chaWe" reply, 

request to a sufficiently high port number as ,0 ehc. an ICMP port 
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Additionally ij(Jbrdance with a preferred embodim^^f the present invention the 
designating step includes designating a closest one of the load balancers by ranking the load 
balancers by network proximity and either of current load and available capacity. 

There is also provided in accordance with a preferred embodiment of the present 
invention a method for determining network proximity, the method including sending from each of at 
least two servers a UDP request having a starting TTL value to a client at a sufficiently high port 
number as to elicit an "ICMP port unreachable" reply message to at least one determining one of the 
servers indicating the UDP request's TTL value on arrival at the client, determining a number of 
hops from each of the servers to the client by subtracting the starting TTL value from the TTL value 
on arrival for each of the servers, and determining which of the servers has fewer hops of the client, 
and designating the server having fewer hops as being closer to the client than the other of the 


servers. 


There is additionally provided in accordance with a preferred embodiment of the present 
invention a network load balancing system including a network, a first load balancer connected to the 
network and having a first load balancer network address, a second load balancer connected to the 
network and having a triangulation network address, the triangulation network address being 
associated with the first load balancer network address, and a requestor connected to the network 
and having a requestor network address, where the requestor is operative to send a request via the 
network to the first load balancer, the request having a source address indicating the requestor 
network address and a destination address indicating the first load balancer network address, the first 
load balancer is operative to forward the request to the second load balancer at the triangulation 
network address, the request source address indicating the requestor network address and the 
destination address indicating the triangulation network address, and the second load balancer - is 
operative to send a response to the requestor at the requestor network address, the response having 
a source address indicating the first load balancer network address associated with the triangulation 
network address and a destination address indicating the first requestor network address. 


Further in ac^ftnce with a preferred embodiment o^J^resent invention either of the 
load balancers is operative to maintain a table of the association between the triangulation network 
address and the first load balancer network address. 

Still further in accordance with a preferred embodiment of the present invention the 
second load balancer is operative to maintain a table of the association between the triangulation 
network address and the first load balancer network address and communicate the association to the 
first load balancer. 

Additionally in accordance with a preferred embodiment of the present invention the 
system further includes a server in communication with the second load balancer, where the second 
load balancer is operative to direct the request from the second, load balancer to the server, and the 
server is operative to compose the response and provide the response to the second load balancer. 

There is also provided in accordance with a preferred embodiment of the present 
invention a network load balancing system including a network, at least two load balancers 
connected to the network, and a requestor connected to the network, where each of the at least two 
load balancers is operative to determine the network proximity of the requestor, and at least one of 
the load balancers is operative to designate a closest one of the load balancers by ranking the load 
balancers by network proximity and direct requests from either of the requestor and a subnet of the 

requestor to the closest load balancer. 

Further in accordance with a preferred embodiment of the present invention the load 
balancers are operative to poll the requestor to yield at least two attributes selected from the group 
consisting of: latency, relative TTL, and number of hops to requestor. 

Still further in accordance with a preferred embodiment of the present invention the load 
balancers are operative to poll the requestor using at least two polling methods selected from the 
group consisting of: pinging, sending a TCP ACK message to the requestor's source address and 
port, sending a TCP ACK message to the requestor's source address and port 80, and sending a 
UDP request to a sufficiently high port number as to elicit an "ICMP port unreachable" reply. 
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Additionally inl^rdance with a preferred embodimen^he present invention at least 
one of the load balancers is operative to designate the closest one of the load balancers by ranking 
the load balancers by network proximity and either of current load and available capacity. 

It is noted that throughout the specification and claims the term "network proximity- 
refers to the quality of the relationship between a client and a first server or server farm as compared 
with the relationship between the client and a second server or server farm when collectively 
considering multiple measurable factors such as latency, hops, and server processing capacity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be understood and appreciated from the following detailed 
description, taken in conjunction with the drawings in which: 

Figs. 1 A - 1C, taken together, are simplified pictorial flow illustrations of a triangulation 
load balancing system constructed and operative in accordance with a preferred embodiment of the 

X, 

present invention; and 

Figs. 2A - 2F, taken together, are simplified pictorial flow illustrations of a network 
proximity load balancing system constructed and operative in accordance with another preferred 
embodiment of the present invention. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Reference is now made to Figs. 1A - 1C which, taken together, are simplified pictorial 
flow illustrations of a triangulation load balancing system constructed and operative in accordance 
with a preferred embodiment of the present invention. Two server farms, generally designated 10 
and 12 respectively, are shown connected to a network 14, such as the Internet, although it is 
appreciated that more than two server farms may be provided. Server farms 10 and 12 typically 
comprise a load balancer 16 and 18 respectively, which may be a dedicated load balancer or a server 
or router configured to operate as a load balancer, with each of the load balancers being connected 
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to one or more servers 20. 41 balancers 16 and IS are aUernativSferred «o herein as LB1 and 
LB 2 respectively. LB, and LB2 typically maintain a server status table 22 and 24 respectively, 
indicting the current load, configuration, availability, and other server information as is common to 
load baUncers. LB. and LB2 a,so typically periodically receive and maintain each other's overaU 
status and load statistics such that LB1 and LB2 can know each other's availability. 

Typical operation of the triangu.ation load balancing system of Figs. 1 A - 1C is now 
described by way of example. As is shown more particularly with reference to Fig. IA. a Cent 26. 
as any known computer terminal configured for communication via network .4, is shown 

,00 .00 , 0. in accordance with network transmission protocols, request 2S indicates the source IP 
address of the requestor, being the IP address ,97, 33, of client 26, and the destination IP addr,s S , 
oeing the virtual IP address ,00.100.1.0 of LB1. LB2 preferably periodic* sends a status report 30 
to LB1, the virtual IP address ,00,00,.0 of LB, being known in advance to LB2. Status report 30 

fem „ - nf i nrovides load statistics, which LB1 
typically indicates the availability of server farm 12 and provides 

maintains. 

LB 2 is preferably capable of having muHiple virtua, IP addresses as is wett known. I. is a 

such as 200,00,,, for LBVs use and store the mapping between the IP address of LB, and the 
designated IP address in a Ration mapping tab.e 32, as is shown more particular* wtth 
reference to Fig. IB. The designated address is referred ,0 herein as the ^angulation address and 
may „e preconfigured with LB1 or periodically provided ,0 LB, fiom LB2. LB, preferab,, 

• „r,k. n> .Hdress 197 1 33.5 of client 26 and the 
maintains in a client mapping table 36 a mapping of the IP address 19/. 

triangulation address 200,00,, of LB2 to which client 26's requests maybe redirected. 

avails. Having decided that client 26's reaues, should be forward*, to LB2, in Fig. ,C LB, 


is 
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substi ,u.es .he desUnauon^dress of reoues. 2S wi.h ,he vinua^address 200,00,, of LB2 

ad dres,modif,ed cue. re q ues. 3S ,o LB2. LB2, upon receiving re q ues, 3S a. i,s vim*, IP address 
200 ,00 , 1 checks ,ria»gu,a«ion mapping .able 32 and finds .ha, vir.ua, TP address 200 ,00,., has 
heen desired for LBl's use. LB2 .herefore uses the vir.ua, IP address ,00,OO,.O of LB, as per 

LB , » is apprecia.ed «ha. response 40 mus. appear «o Cien. 26 .o come from IB., otherwise Cien, 

r - + a ~o^t Client 26 may continue to send requests 
26 will simply ignore response 40 as an unsolved packet Chen, 26 y 

t0 lb, which LB, .hen towards recasts „ LB2 a. .he designed triangulation address. LB2 

IP address. 

Reference is now made .o Figs. 2A - 2F which, taken together, are simplified pictona, 
fl ow illusions of a network proximity load balancing sys,em cons.ruc.ed and operative m 
accordance wi.h ancher preferred embodiment of .he presen. invention. The configuration of .he 

, . ,„ i A - IC except as otherwise described 

system of Figs. 2A - 2F is substanmlly snrnlar to F,gs. 1 A K. P 

hereinbelow. For Ulus.ra.ion purposes, a .hird server farm, generauy desired SO, is shown 
connected .o ne.work ,4, ahhough i,is apprecia,ed tha, two or more server farms may beprovided. 

halancer with load balancer 52 being connected to 
server or router configured to operate as a load balancer, wren 

two or more servers 20. Load baiancer 52 is alternatively referred to herein as LB3. 

Typical operation of the network proximity load balancing system of Figs. 2A - 2F ,s 
now described by way of example. As is shown more par,icu,ar,y wi.h reference .o Hg. 2A, chen. 
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farm site or sites to whieWHquests from a particular subnet shlK be routed. Determining the 
"best" site is described in greater detail hereinbelow. 

Upon receiving a request, LB1 may decide to service the request or not based on normal 
load balancing considerations. In any case, LB1 may check proximity table 54 for an entry indicating 
the subnet corresponding to the subnet of the source IP address of the incoming request. As is 
shown more particularly with reference to Fig. 2B, if no corresponding entry is found in proximity 
table 54, LB1 may send a proximity request 56 to LB2, and LB3, whose virtual IP addresses are 
known in advance to LB1. Proximity request 56 indicates the IP address of client 26. 

A "network proximity" may be determined for a requestor such as client 26 with respect 
to each load balancer/server farm by measuring and collectively considering various attributes of the 
relationship such as latency, hops between client 26 and each server farm, and the processing 
capacity and quality of each server farm site. To determine comparative network proximity, LB1, 
LB2, and LB3 preferably each send a polling request 58 to client 26 using known polling 
mechanisms. While known polling mechanisms included pinging client 26, sending a TCP ACK 
message to client 26 may be used where pinging would otherwise fail due to an intervening firewall 
or NAT device filtering out a polling message. A TCP ACK may be sent to the client's source IP 
address and port. If the client's request was via a UDP connection, a TCP ACK to the client's 
source IP address and port 80 may be used. One or both TCP ACK messages should bypass any 
intervening NAT or firewall and cause client 26 to send a TCP RST message, which may be used to 
determine both latency and TTL. While TTL does not necessarily indicate the number of hops from 
the client to the load balancer, comparing TTL values from LB1, LB2, and LB3 should indicate 
whether it took relatively more or less hops. 

Another polling method involves sending a UDP request to a relatively high port number 
at the client, such as 2090. This request would typically be answered with an "ICMP port 
unreachable" reply which would indicate the TTL value of the UDP request on arrival at the client. 
Since the starting TTL value of each outgoing UDP request is known, the actual number of hops to 
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„ ■ msub„ac. i ng. h eTTLva.ueonaMv#.h=cUen,from*es.aning 

thA client may be determine^J subtracting 

f • a TCP ACK UDP, and other polling techniques may be used 
TTL value. A combination of pmgmg, TCP ACK, u 

since any one polling request might fail. 

• 9D sending a polling response 60 to the vano f 

client 2* , .own m » sen, ^ ^ ^ ^ ^ 

quests. The responses may * used «o determ.ne the iatency ^ 
, LB2 andl.B 3 .hensendponin g resu 1 .s62«oLB 1 ,ass h own,nF, g .2E. 

m d d LB1 LB2 and LB3 ranked, such as by we ig h.in g each attribute 

resul ,s may then be compare,, and U». 

„ determine a - we, g h,ed - - - server - **. 

firm capacity and availability, such as may be requested 
tog e,ner with server farm capac y ^ ^ ^ ^ , A 

— " ba,anci„ g repo.n g — - _ ^ ^ ^ ^ fc ^ 

-n ,.a— — < ^^.tbeCosestsitemaybe 

— ^ 1 lM d, and capac, resu,.. may 

th a, which has the lowesOotal wetghted vaiue for ah pol 

^.tworicpro^shouidaioadrepor.rece.vedfrom.hebe 

«, In addition, the best network prox.rn.ty to a parucular 
— "7" h ledtimesoraneraprede—amountonrmehaseiapsed 
periodicaUy redetermined, such as at fixed 
^^etimetheiastdetenninationwasmade. 

M is shown more particuiariy with reference to F,g.2F,o 
" . ed Cien. 26 may be redirected ,0 the Coses, site usin g various method, 
26 has been determined, client 10 may 
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„ ithitt address. If an HTTP request 
the present ta ven ti on >— ^ 

— — : : ^ . — - 

ir« 1A - 1C and 2A - 2F 
techniques. Ascribed with reference to Figs. 

It , appreciated that the steps descnh ^ ^ ^ ifl fect 

^ in the order shown unless om 
„eed no. ne«ssanW b e performed 

1 Converse.,, various feature* of wbc o m taa..on. 
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